System and method for multiple viewing-window display of capsule images

ABSTRACT

Systems and methods are provided for displaying images captured from a capsule camera system. In order to increase the efficiency of viewing the image sequence, the image sequence is divided into multiple sub-sequences and the multiple sub-sequences are displayed in multiple viewing windows on a display screen concurrently. For images from a panoramic capsule system, the images typically have a very wide aspect ratio and may require different configuration for displaying in multiple viewing windows than that for image sequence having non-wide aspect ratio. The present invention also discloses methods and systems that divide a sequence from panoramic capsule camera into multiple member sequences and form an aggregated video. The aggregated video not only makes viewing more comfortable, but also speeds up viewing time.

CROSS REFERENCE TO RELATED APPLICATIONS

The present invention is related and claims priority to U.S. patent application Ser. No. 12/577,626, entitled “SYSTEM AND METHOD FOR DISPLAY OF PANORAMIC CAPSULE IMAGES” and filed on Oct. 12, 2009. The U.S. patent application is hereby incorporated by reference in its entireties.

FIELD OF THE INVENTION

The present invention relates to diagnostic imaging inside the human body. In particular, the present invention relates to displaying images captured by a capsule camera system using multiple viewing windows.

BACKGROUND

Devices for imaging body cavities or passages in vivo are known in the art and include endoscopes and autonomous encapsulated cameras. Endoscopes are flexible or rigid tubes that pass into the body through an orifice or surgical opening, typically into the esophagus via the mouth or into the colon via the rectum. An image is formed at the distal end using a lens and transmitted to the proximal end, outside the body, either by a lens-relay system or by a coherent fiber-optic bundle. A conceptually similar instrument might record an image electronically at the distal end, for example using a CCD or CMOS array, and transfer the image data as an electrical signal to the proximal end through a cable. Endoscopes allow a physician control over the field of view and are well-accepted diagnostic tools. However, they do have a number of limitations, present risks to the patient, are invasive and uncomfortable for the patient, and their cost restricts their application as routine health-screening tools.

Because of the difficulty traversing a convoluted passage, endoscopes cannot reach the majority of the small intestine and special techniques and precautions, that add cost, are required to reach the entirety of the colon. Endoscopic risks include the possible perforation of the bodily organs traversed and complications arising from anesthesia. Moreover, a trade-off must be made between patient pain during the procedure and the health risks and post-procedural down time associated with anesthesia. Endoscopies are necessarily inpatient services that involve a significant amount of time from clinicians and thus are costly.

An alternative in vivo image sensor that addresses many of these problems is capsule endoscope. A camera is housed in a swallowable capsule, along with a radio transmitter for transmitting data, primarily comprising images recorded by the digital camera, to a base-station receiver or transceiver and data recorder outside the body. The capsule may also include a radio receiver for receiving instructions or other data from a base-station transmitter. Instead of radio-frequency transmission, lower-frequency electromagnetic signals may be used. Power may be supplied inductively from an external inductor to an internal inductor within the capsule or from a battery within the capsule.

An autonomous capsule camera system with on-board data storage was disclosed in the U.S. patent application Ser. No. 11/533,304, entitled “In Vivo Autonomous Camera with On-Board Data Storage or Digital Wireless Transmission in Regulatory Approved Band,” filed on Sep. 19, 2006. This application describes a capsule system using on-board storage such as semiconductor nonvolatile archival memory to store captured images. After the capsule passes from the body, it is retrieved. Capsule housing is opened and the images stored are transferred to a computer workstation for storage and analysis. For capsule images either received through wireless transmission or retrieved from on-board storage, the images will have to be displayed and examined by diagnostician to identify potential anomalies.

Besides the above mentioned forward-looking capsule cameras, there are other types of capsule cameras that provide side view or panoramic view. A side or reverse angle is required in order to view the tissue surface properly. Conventional devices are not able to see such surfaces, since their FOV is substantially forward looking. It is important for a physician to see all areas of these organs, as polyps or other irregularities need to be thoroughly observed for an accurate diagnosis. Since conventional capsules are unable to see the hidden areas around the ridges, irregularities may be missed, and critical diagnoses of serious medical conditions may be flawed. A camera configured to capture a panoramic image of an environment surrounding the camera is disclosed in U.S. patent application Ser. No. 11/642,275, entitled “In vivo sensor with panoramic camera” and filed on Dec. 19, 2006. The panoramic camera is configured with a longitudinal field of view (FOV) defined by a range of view angles relative to a longitudinal axis of the capsule and a latitudinal field of view defined by a panoramic range of azimuth angles about the longitudinal axis such that the camera can capture a panoramic image covering substantially a 360 deg latitudinal FOV.

Conceptually, multiple individual cameras may be configured to cover completely or substantially a 360 deg latitudinal FOV. However, such panoramic capsule system may be expensive since multiple image sensors and associated electronics may be required. A cost-effective panoramic capsule system is disclosed in U.S. patent application Ser. No. 11/624,209, entitled “Panoramic Imaging System”, filed on Jan. 17, 2007. The panoramic capsule system uses an optical system configured to combine several fields-of-view to cover a 360° view. Furthermore, the combined fields-of-view is projected onto a single sensor to save cost. Therefore, this single sensor capsule system functions effectively as multiple cameras at a lower cost.

Similar to the situation with a conventional forward looking capsule system, for side-view and panoramic view capsule systems with digital wireless transmission or on-board storage, the captured images will be played back for analysis and examination. During playback, the diagnostician looks to find polyps or other points of interest as quickly and efficiently as possible. The playback is at a controllable frame rate and may be increased to reduce viewing time. However, if the frame rate is increased too much, the gyrations of the field of view (FOV) will make the video stream difficult to follow. At whatever frame rate, image gyration demands more cognitive effort on the diagnostician's part to follow, resulting in viewer fatigue and increased chance of missing important information in the video.

For images associated with either a conventional capsule camera with forward-looking view, a capsule camera with side view or a panoramic camera with a panoramic view, the images will be viewed by diagnostician on a viewing station or a display device. Due to the large amount of image data to be examined and the cost associated with the diagnostician's time, it is desired that the video corresponding to the image data can be displayed in a way that will help reduce the diagnostician viewing time without compromising the quality and reliability of the diagnostics.

SUMMARY

The present invention provides an effective method and system for viewing an image sequence generated from a capsule camera system. In one embodiment, a method for displaying video of images from a capsule camera system is disclosed which comprises accepting images captured with the capsule camera system, generating video member sequences (i.e., sub-sequences) based on the images, composing an aggregated video comprising a plurality of the video member sequences and providing the aggregated video. The capsule images may be captured by a single capsule camera, or by a panoramic camera system combining multiple fields-of-view into a single panoramic image. The member sequences may be generated by uniformly interleaving, i.e. sub-sampling the capsule images temporally, or by dividing the capsule images into temporally consecutive sections. The aggregated video, i.e., the collection of all video member sequences, can be composed by arranging the plurality of video member sequences for displaying in multiple viewing windows of the display window. In another embodiment, the method for displaying video of panoramic images from a capsule camera system is disclosed which comprises further steps of pre-processing and stitching the images. In yet another embodiment of the current invention, the method for displaying video of panoramic images comprising generating the member sequences by spatial shifting images of the original sequence cyclically. In another alternative embodiment of the present invention, the frame rate is provided for the aggregated video.

A system for displaying video of capsule images is also disclosed. The system comprises an interface module coupled to accept capsule images captured with the capsule camera system, a first processing module coupled to the interface module for accessing the panorama images and configured to generate video member sequences based on the capsule images, a second processing module coupled to the first processing module for receiving the video member sequence and configured to compose an aggregated video comprising a plurality of the video member sequences, and an output interface module coupled to receive and to provide the aggregated video. The capsule images may be captured by a single capsule camera or by a panoramic camera system combining multiple fields-of-view into a single panoramic image. The member sequences are generated by uniformly interleaving the capsule images temporally or dividing the panorama images into temporally consecutive sections. For displaying video of panoramic images, the member sequences can be further generated by spatially cyclically shifting images of the original sequence. The aggregated video, i.e., the collection of all video member sequences, can be composed by arranging the plurality of the video member sequences for displaying in the display window. In another alternative embodiment of the present invention, the frame rate is provided for the aggregated video.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows schematically a capsule camera system in the GI tract, where archival memory is used to store capsule images to be analyzed and/or examined.

FIG. 2A shows schematically a panoramic capsule camera system in the GI tract, where archival memory is used to store capsule images to be analyzed and/or examined.

FIG. 2B shows schematically a panoramic capsule camera system in the GI tract, where wireless transmission is used to send panoramic images to a base station for further analysis and/or examination.

FIG. 3A shows a video screen divided into a display window for displaying video and information bars for displaying information associated with the underlying capsule images.

FIG. 3B shows an exemplary multiple viewing-windows where 8 viewing-windows are used for the capsule images.

FIG. 4A shows one exemplary arrangement of display window for video member sequences from a panoramic camera system using a reflective element having 4 sides.

FIG. 4B shows an alternative exemplary arrangement of display window for video member sequences from a panoramic camera system using a reflective element having 4 sides.

FIG. 5A shows one exemplary arrangement of display window to accommodate two video viewing windows simultaneously.

FIG. 5B shows an alternative exemplary arrangement of display window to accommodate two video viewing windows simultaneously.

FIG. 5C shows another exemplary arrangement of display window to accommodate two video viewing windows showing two images of the same instance where one is a spatially shifted version of the other.

FIG. 6A shows one exemplary arrangement of display window to accommodate three video viewing windows simultaneously.

FIG. 6B shows an exemplary alternative arrangement of display window to accommodate three video viewing windows simultaneously.

FIG. 7 shows an exemplary temporal sub-sampling method to construct member sequences from an original sequence.

FIG. 8 shows a method of constructing member sequences from an original sequence by equally dividing the original sequence into sections.

FIG. 9A shows an exemplary flowchart for a capsule system embodying the present invention.

FIG. 9B shows an exemplary flowchart for a panoramic capsule system embodying the present invention.

DETAILED DESCRIPTION OF THE INVENTION

It will be readily understood that the components of the present invention, as generally described and illustrated in the figures herein, may be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the systems and methods of the present invention, as represented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention.

Reference throughout this specification to “one embodiment,” “an embodiment,” or similar language means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment.

Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, etc. In other instances, well-known structures, or operations are not shown or described in detail to avoid obscuring aspects of the invention.

The illustrated embodiments of the invention will be best understood by reference to the drawings, wherein like parts are designated by like numerals throughout. The following description is intended only by way of example, and simply illustrates certain selected embodiments of apparatus and methods that are consistent with the invention as claimed herein.

The present invention discloses methods and systems for multiple viewing windows so that multiple sub-sequences of a capsule image sequence can be displayed concurrently to increase the efficiency of examination by a diagnostician. The sub-sequence is also called a member sequence in this disclosure. The images may be received from a capsule camera system having on-board archival memory to store the images or received from a capsule camera having wireless transmission module. FIG. 1 shows a swallowable capsule system 110 inside body lumen 100, in accordance with one embodiment of the present invention. Lumen 100 may be, for example, the colon, small intestines, the esophagus, or the stomach. Capsule system 110 is entirely autonomous while inside the body, with all of its elements encapsulated in a capsule housing 10 that provides a moisture barrier, protecting the internal components from bodily fluids. Capsule housing 10 is transparent or at least transparent over the lens and LED areas, so as to allow light from the light-emitting diodes (LEDs) of illuminating system 12A to pass through the wall of capsule housing 10 to the lumen 100 walls, and to allow the scattered light from the lumen 100 walls to be collected and imaged within the capsule camera. Capsule housing 10 also protects lumen 100 from direct contact with foreign material inside capsule housing 10. Capsule housing 10 is provided a shape that enables it to be swallowed easily and later to pass through of the GI tract. Generally, capsule housing 10 is sterile, made of non-toxic material, and is sufficiently smooth to minimize the chance of lodging within the lumen.

As shown in FIG. 1, capsule system 110 includes illuminating system 12A and a camera that includes optical system 14A and image sensor 16. A semiconductor nonvolatile archival memory 20 may be provided to allow the images to be stored and later retrieved at a docking station outside the body, after the capsule is recovered. System 110 includes battery power supply 24 and an output port 26. Capsule system 110 may be propelled through the GI tract by peristalsis.

Illuminating system 12A may be implemented by LEDs. In FIG. 1, the LEDs are located adjacent to the camera's aperture, although other configurations are possible. The light source may also be provided, for example, behind the aperture. Other light sources, such as laser diodes, may also be used. Alternatively, white light sources or a combination of two or more narrow-wavelength-band sources may also be used. White LEDs are available that may include a blue LED or a violet LED, along with phosphorescent materials that are excited by the LED light to emit light at longer wavelengths. The portion of capsule housing 10 that allows light to pass through may be made from bio-compatible glass or polymer.

Optical system 14A, which may include multiple refractive, diffractive, or reflective lens elements, provides an image of the lumen walls on image sensor 16. Image sensor 16 may be provided by charged-coupled devices (CCD) or complementary metal-oxide-semiconductor (CMOS) type devices that convert the received light intensities into corresponding electrical signals. Image sensor 16 may have a monochromatic response or include a color filter array such that a color image may be captured (e.g. using the RGB or CYM representations). The analog signals from image sensor 16 are preferably converted into digital form to allow processing in digital form. Such conversion may be accomplished using an analog-to-digital (A/D) converter, which may be provided inside the sensor (as in the current case), or in another portion inside capsule housing 10. The A/D unit may be provided between image sensor 16 and the rest of the system. LEDs in illuminating system 12A are synchronized with the operations of image sensor 16. One function of control module 22 is to control the LEDs during image capture operation. The control module may also be responsible for other functions such as managing image capture and coordinating image retrieval.

After the capsule camera traveled through the GI tract and exits from the body, the capsule camera is retrieved and the images stored in the archival memory are read out through the output port. The received images are usually transferred to a base station for processing and for a diagnostician to examine. The accuracy as well as efficiency of diagnostics is most important. A diagnostician is expected to examine all images and correctly identify all anomalies. In order to help the diagnostician to perform the examination more efficiently without compromising the quality of examination, the received images are subject to processing of the present invention by displaying multiple sub-sequences of the images in multiple viewing windows concurrently. The desire of using multiple viewing windows is not restricted to the conventional capsule camera. For capsule cameras having panoramic view, the need for efficient viewing for diagnostics also arises.

FIG. 2A shows an exemplary swallowable panoramic capsule system 202 inside body lumen 100. Lumen 100 may be, for example, the colon, small intestines, the esophagus, or the stomach. Panoramic capsule system 202 is entirely autonomous while inside the body, with all of its elements encapsulated in a capsule housing 10 that provides a moisture barrier, protecting the internal components from bodily fluids. Capsule housing 10 is transparent or at least transparent over the areas of lens and LEDs, so as to allow light from the light-emitting diodes (LEDs) of illuminating system 12 to pass through the wall of capsule housing 10 to the lumen 100 walls, and to allow the scattered light from the lumen 100 walls to be collected and imaged within the capsule.

As shown in FIG. 2A, panoramic capsule system 202 includes illuminating system 12 and a camera that includes optical system 14 and image sensor 16. A semiconductor nonvolatile archival memory 20 may be provided to allow the images to be retrieved at a docking station outside the body, after the capsule is recovered. Panoramic capsule system 202 includes battery power supply 24 and an output port 26. Panoramic capsule system 202 may be propelled through the GI tract by peristalsis.

Illuminating system 12 may be implemented by LEDs. In FIG. 1, the LEDs are located adjacent to the camera's aperture, although other configurations are possible. The light source may also be provided, for example, behind the aperture. Other light sources, such as laser diodes, may also be used. Alternatively, white light sources or a combination of two or more narrow-wavelength-band sources may also be used. White LEDs are available that may include a blue LED or a violet LED, along with phosphorescent materials that are excited by the LED light to emit light at longer wavelengths. The portion of capsule housing 10 that allows light to pass through may be made from bio-compatible glass or polymer.

Optical system 14, which may include multiple refractive, diffractive, or reflective lens elements, provides an image of the lumen walls on image sensor 16. Image sensor 16 may be provided by charged-coupled devices (CCD) or complementary metal-oxide-semiconductor (CMOS) type devices that convert the received light intensities into corresponding electrical signals. Image sensor 16 may have a monochromatic response or include a color filter array such that a color image may be captured (e.g. using the RGB or CYM representations). The analog signals from image sensor 16 are preferably converted into digital form to allow processing in digital form. Such conversion may be accomplished using an analog-to-digital (A/D) converter, which may be provided inside the sensor (as in the current case), or in another portion inside capsule housing 10. The A/D unit may be provided between image sensor 16 and the rest of the system. LEDs in illuminating system 12 are synchronized with the operations of image sensor 16. One function of control module 22 is to control the LEDs during image capture operation.

FIG. 2B shows an alternative swallowable capsule system 210. Capsule system 210 may be constructed substantially the same as panoramic capsule system 202 of FIG. 2A, except that archival memory system 20 and output port 26 are no longer required. Capsule system 210 also includes communication protocol encoder 220, transmitter 226 and antenna 228 that are used in the wireless transmission to transmit captured images to a receiving device attached or carried by the person being administered with a capsule system 210. The elements of panoramic capsule system 202 and panoramic capsule system 210 that are substantially the same are therefore provided the same reference numerals. Their constructions and functions are therefore not described here repeatedly. Communication protocol encoder 220 may be implemented in software that runs on a DSP or a CPU, in hardware, or a combination of software and hardware. Transmitter 226 and antenna system 228 are used for transmitting the captured digital image.

The panoramic camera systems shown in FIG. 2A and FIG. 2B are based on a system using a pyramidal reflective element having multiple reflective side facets facing in different directions. Each of the reflective side facets is associated with a component image in its respective direction. The panoramic camera system combines the component images to form a composite image. There are also other types of panoramic camera systems. For example, in U.S. patent application Ser. No. 11/642,275, entitled “In vivo sensor with panoramic camera” and filed on Dec. 19, 2006, a panoramic camera system using a panoramic annular lens is described. The panoramic annular lens is configured to enable images to be captured by the panoramic camera radially about the longitudinal axis and onto the single image plane. The panoramic images captured by the system having panoramic annular lens reflective side facets represent continuous field of view up to 360°. On the other hand, the panoramic images captured by the system having a reflective element with multiple reflective side facets may represent multiple contiguous fields-of-view.

Though the panoramic images may correspond to a 360° view of the lumen, a practical and convenient way to view the panoramic images is on a display screen which is essentially flat. Therefore the panoramic image has to be properly placed on the flat screen for viewing. For example, the panoramic image captured by the panoramic camera system with a 4-sided reflective element has 4 component images. Each component image corresponds to an image captured in a perspective direction and each component image may be slightly overlapped with its two neighboring component images. The 4 component images are connected in a circular fashion. Images captured by a panoramic camera having a panoramic annular lens will provide continuous fields-of-view and have no boarder lines within the image.

FIG. 3A shows a display screen 300 for displaying the sequence of capsule images. The display screen 300 may reserve some screen areas 320 and 330 for displaying other information such as patient information associated with the underlying image sequence and/or the location information, if available, of the current image being shown. The area 310 is designated as the display window for showing the sequence of capsule images. The image size from capsule cameras usually is much smaller the display screen size. The display window is capable of providing multiple viewing windows to accommodate multiple sub-sequences of image sequence concurrently without sacrificing image resolution. FIG. 3B illustrates an exemplary multiple viewing window configuration where the display contains 8 viewing windows 352-368 (arranged as 2 rows×4 columns) for concurrent display of 8 sub-sequences of images. The 8 sub-sequences are derived from the original sequence and the collection of the sub-sequences is called aggregated video. Each viewing window should be used to display one sub-sequence. However, a user may decide not to use all the viewing windows at his/her choice. There are various possible configurations for multiple viewing windows. Depending on the size of display window and the size of image, multiple viewing windows may be configured as 2×2, 2×3, 2×4, 2×6, 3×3, 3×4, 3×6, and etc. The viewing windows may be connected to each other without any space in between, or having some space in between as shown in FIG. 3B. The image sub-sequence for each viewing window may be determined independently. For example, the image corresponding to a sub-sequence in a viewing window may be flipped horizontally while the image corresponding to another sub-sequence in another viewing window may be flipped vertically. The images also may be subject to some pre-processing before they are displayed in the view window. For example, the image may be trimmed to remove portions around the boarders, enhanced to improve the visibility, or coordinate transformed to correct geometric distortion. When the multiple sub-sequences are displayed in the multiple viewing windows concurrently, it may require more cognitive efforts to identify anomaly in the multiple sub-sequence. It may be desirable to display the multiple sub-sequences at a slower frame rate than to display a single sequence. For example, while a single sequence is typically displayed at 30 frames per second, the 8 sub-sequences may be displayed at slower frame rate, such as 20, 15 or other frame rate. On the other hand, if there is no movement or little movement for a period of time, it is desirable to display the aggregated video at faster frame rate during this period.

For images from a panoramic capsule system, the configuration for multiple viewing windows may be different. For example, a panoramic image captured from a panoramic camera system having a 4-sided reflective element can be shown as a single image 410 in FIG. 4A. The structure of the 4-sided reflective element will result in a border line between 2 neighboring component images. The 4 component images are labeled as W 412, N 414, E 416 and S 418 corresponding to 4 directions of the 4 reflective sides. The panoramic image 410 shown in FIG. 4A is created by disjoining the component images corresponding to the W direction and the S direction. The resulting panoramic image 410 contains component images W, N, E and S from left to right. Note that while the component images are labeled with W, N, E and S directions, these 4 directions are relative directions and any of the 4 component images can be designated as the N-direction component image. Similarly, the panoramic image may be disjoined at any other boarder, such as the boarder between the N direction and the E direction resulting in a panoramic image containing component images E, S, W, and N from left to right. For a panoramic system using a panoramic annular lens, the image will look seamlessly providing continuous field-of-view. The 360-degree panoramic image can be disjoined at any desired location. The 4 component images in the 4-side reflective element camera could be stitched seamlessly by image processing technology and the image produced could also be disjoined at any desired location.

The panoramic image may also be displayed by placing component image in its respective direction. For example, the 4 component images are arranged in 4 directions with its orientation rotated to match its perspective view, as shown in FIG. 4B. The component image 414 is placed in the north position without rotation. The component image 412 is rotated 90 degrees counterclockwise and placed in the west position while the image 416 is rotated 90 degrees clockwise and placed in the east position. The component image 418 is rotated 180 degrees and placed in the south position. At the center 425 of the 4 component images represents a virtual location corresponding to the panoramic camera inside the GI track. The 4 component images represent what the panoramic camera would see in the 4 directions. Again, the W, N, E and S directions are relative directions and any component image can be designated as the N-direction component image. As mentioned earlier, the panoramic images captured by a panoramic camera system having a panoramic annular lens do not have the boarder lines. However, such panoramic image still can use the same arrangement as shown in FIG. 4B. For the arrangement of FIG. 4B, the panoramic image may be divided into 4 sub-images, rotated and placed in respective positions.

One of the main purposes to display the sequence of capsule images is for diagnostician to analyze and examine the video to spot any possible anomaly. The factors to take into consideration for determining display arrangement include a set up for comfortable viewing and less eye fatigue, and efficient viewing time. For both traditionally colonoscopy and capsule colon endoscopy, the fatigue factors become a major problem in efficacy. With the rampant colon cancer rate, all population above 40-50 years old are recommended for regular colon examination, but there are only limited doctors. For traditional colonoscopy the detection rate drops after 3-5 procedures because the procedure requires about 30 minutes of highly technical maneuver of colonoscope. For capsule colon endoscope each reading of 10's or 100's of images per patient could easily make doctors fatigued and lower the detection rate. The vast majority public do not comply the recommendation for regular colon check up due to the invasiveness of the procedure. The capsule colon endoscope is supposed to increase the compliance rate tremendously. Consequently, the issue of reducing fatigue is critical in order to serve the increased number of colonoscopy procedures. The other critical issue is cost. The doctor's time is expensive and is the major component among both colonoscopy procedures. If the viewing throughput rate can be increased, the total healthcare cost will be substantially reduced. Currently the waiting time for a colonoscopy examination appointment is about several weeks, or may even be several months. With the dramatic increase in compliance rate helped with the use of capsule endoscope, there may not be enough doctors to meet the increasing demand. Therefore, methods and systems to reduce the viewing time without compromising the detection rate has another important meaning. The panoramic image shown in FIG. 4A is an intuitive arrangement. However, based on actual viewing experience, the image having extremely large aspect ratio (the ratio of picture width to picture height) may often cause eye fatigue. By placing multiple panoramic images in the same display window as shown in FIG. 5 and FIG. 6, it reveals a surprisingly pleasant viewing experience. In both FIG. 5 and FIG. 6, the composed images have the same picture width while the total picture height is increased. Such arrangement effectively changes the picture aspect ratio to a lower value. The aspect ratios for FIGS. 5A-C and FIGS. 6A-B are close to that of cinema viewing. Furthermore, more images are displayed in the display window of FIGS. 5A-C and FIGS. 6A-B, which implies that a shorter viewing time is required if the video is played back at the same picture rate as before.

The single image strip with high aspect ratio will not only cause fatigue but also will slow down the video reading speed. When a viewer views the video, the natural inertia is to focus on the middle and then look at one side, and then the other side. If some parts of the video on the left end attract viewer's attention, the viewer still needs to look at the right end later. This dynamic tends to slow down the video viewing and the continuous and strenuous eyeball movement will quickly get the viewer fatigued.

In FIG. 5A, two panoramic images 510 and 512 are displayed on screen at the same time. These two panoramic images 510 and 512 are selected from a sequence of panoramic images for viewing. When images correspond to an image sequence are displayed sequentially at a certain frame rate (number of frames per second), the images render themselves as a video. The display locations of the two panoramic images actually define two video viewing windows. Each of the two windows 510 and 512 can be used to display a sequence of panoramic images. Methods to create multiple sub-sequences based on a received sequence of panoramic images will be presented later. The multiple sub-sequences consist of multiple member sequences which are derived from the original sequence. While the two panoramic images shown in FIG. 5A have the same up-right orientation, one of the two images may be displayed upside-down, i.e., being flipped vertically as shown in FIG. 5B, where the panoramic image 514 is vertically flipped image 512. Alternatively the image 512 may stay in the same orientation and the image 510 is flipped vertically. When a capsule camera travels through the GI track, the captured images will appear to move mainly in one direction when the images are shown sequentially on a screen. The configuration of FIG. 5A contains two video windows having images in the same orientation. When the two member sequences are played back as video, the contents in the two member sequences will appear to move in the same direction. On the other hand, the contents in the two member sequences corresponding to FIG. 5B will appear to move inward or outward from the center of the two video display windows. It can be a viewer's personal preference to view the two member sequences moving in the same direction or moving inward/outward from the center.

FIG. 5C shows an alternative arrangement for displaying multiple sub-sequences where the second member sequence consists of a shifted version of the images from the original sequence. As shown in FIG. 5C, an object happens to be located between the component images W and S. Since the panoramic image is formed by stitching, from left to right, images W, N, E and S. Therefore, the object is shown as two parts 511 a and 511 b at both ends of the panoramic image. This split image makes it hard for the diagnostician to perform the examination. Now, the same set of component images are displayed at the bottom of the original panoramic image by shifting the component images 2 positions to the right to form an image of the second member sequence 516, as shown in FIG. 5C. The two parts of the split image are now joined as a complete image 511 in the member sequence 516. On the other hand, an object 513 in the original sequence 510 may now become split into two parts 513 a and 513 b in member sequence 516 if it happens to be located between component images N and E. The above example discloses the member sequence 516 is a spatially shifted version of the original sequence 510. Consequently the images in member sequence 510 will appear in member sequence 516 in a spatially shifted fashion. In this arrangement, the member sequence 510 will have to contain all the images in the original sequence to ensure every image is displayed.

A first member sequence and a second member sequence may be derived from an original sequence using 2:1 temporal sub-sampling. Since neighboring images usually have high similarity, the above spatial shifting may be applied to the second member sequence which is a temporal subset of the original sequence. In this arrangement, the total number of images in the two member sequences is the same as that of the original sequence. Since two display windows are used and the display time will be reduced to half if the display frame rate maintains the same. In addition, such arrangement provides a convenient view experience since non-split objects are always viewable in the center of the display.

In the case that the member sequence corresponding to image 510 is the original sequence, the second member sequence corresponding to image 516 as shown in FIG. 5C will have the same number of images as the original sequence. If the multiple sub-sequences are displayed at normal speed, it will result in the same amount of viewing time. Nevertheless, the arrangement shown in FIG. 5C still provides several advantages. First, it takes care of the split object issue. An object located between any two component images will be always shown properly in one of the member sequences. Another advantage is that a diagnostician may now focus on the left half, the right half or the center part of the aggregated video without missing any component image. For example, the 4 component images on the left half of the screen include images W, N, E and S which are a complete set of component images. The 4 component images in the middle include images N, E, S and W which again are a complete set of component images. Similarly, the 4 component images on the right half of the screen contain a full set of component images. Therefore, the diagnostician doesn't have to scan images side to side and this will make the viewing experience much more pleasant and relaxed. While the example in FIG. 5C shows a panorama image having 4 component images, the present invention is also applicable to continuous panorama images without any border within the image. The panorama image is considered continuous by wrapping around the two ends that connect the scene. Therefore, the panorama image is cyclically shifted by half of the image width to generate the second member sequence.

Depending on the layout of the display screen and the size of the panoramic image, more than two member sequences may be displayed on the screen at the same time. For example, FIG. 6A shows three panoramic images being displayed in the same display window where all three images 610, 612, and 614 have the same orientation. FIG. 6B shows a similar arrangement having three panoramic images displayed concurrently in the same display window. However, the image 616 in the middle is a vertically inverted version of image 612. Therefore, the images 610 and 616 will look like they are joined in the middle between the two images and provide the same visual sensation as the images 510 and 514 of FIG. 5B. Therefore, the image 610 and the image 616 will appear move away from each other or move in toward each other depending on the image orientation and camera movement. Similarly, the images 616 and 614 will look like they are joined in the middle between the two images and provide the same visual sensation as the images 510 and 514 of FIG. 5B. In another arrangement similar to that in FIG. 6A, the orientation of the middle image 612 remains the same and both images 610 and 614 are inverted.

The multiple sub-sequences are derived from the original sequence. One method to generate multiple sub-sequences is to perform spatial processing on the original sequence. For example, the arrangement in FIG. 5C illustrates an example of spatial processing by cyclically rotating the original image. According to FIG. 5C, one member sequence consisting of a cyclically shifted version of the original images is generated. The cyclically shifted member sequence contains the same amount of data as the original sequence. The cyclically shifted member sequence along with the original sequence forms multiple sub-sequences. More cyclically shifted member sequences can be formed by cyclically shifting the original image by different amount. For example, three cyclically shifted member sequences can be generated from the original sequence by cyclically shifting by 1, 2 and 3 component images respectively. Along with the original sequence, the set contains 4 member sequences. The 4 member sequences may be displayed by stacking up one member sequence on the top of the other. The order of stacking up may be selected as individual preference. For example, the original sequence may be place on the top and the member sequences cyclically shifted by 1, 2 and 3 component images may be placed below the original sequence in order. Alternatively, the original sequence may be placed on the top; the member sequence cyclically shifted by 2 component images is placed below the original sequence, followed by the member sequence cyclically shifted by 1 component image and 3 component images. The above examples are for illustration purpose to demonstrate alternatives of spatial processing to generate member sequences for multiple sub-sequences. Other spatial processing methods to generate member sequences are also possible. The cyclical shifting method is also applicable to images having continuous scenes without borders. The amount of cyclically shifting may be arbitrary instead of the unit of component image. In the above example, one of the four member sequences is the original sequence and the other three are spatially shifted version of the original sequence. Therefore the three spatially shifted member sequences have the same number of images as the original sequences. The resulting multiple sub-sequences will take the same amount of viewing time if it is displayed at a regular frame rate. Alternatively, four sub-sequences may be generated by 4:1 temporal sub-sampling of the original sequence, where the temporal sub-sampling will be described in more detail later. One of the sub-sequences can be used as a member sequence directly. The other three member sequences can be derived from the other three sub-sequences by spatially, cyclically shifting the respective sub-sequences at different spatial distances. Such multiple sub-sequences are a result of temporal processing of the original sequence followed by spatial processing.

Other than the spatial processing discussed above, there are also temporal methods to generate member sequences. One preferred method to temporally derive multiple sub-sequences is shown in FIG. 7 where 3 sub-sequences are generated from the original sequence. The original sequence has a total of 3n images and each of the resulting sub-sequence contains n images. In the case that the total number of images in the original sequence is not divisible by 3, the last picture in the sequence may be repeated as needed to make the total number divisible by 3. The sub-sequence A contains images A₁, A₂, . . . , A_(i), . . . , A_(n), where i is the index corresponding to the temporal order that the image is displayed. Similarly, sub-sequences B and C contain images B_(i) and C_(i) respectively having index i corresponding to the temporal order that the images are displayed. The sub-sequences A, B and C may be used as member sequences directly. Therefore, at each time instance, respective images A_(i), B_(i) and C_(i) are displayed on the screen simultaneously. It is preferred that the images displayed on the screen simultaneously have the maximum similarities among them so that it is easier for the eyes to visualize and perceive the contents. Consequently, the set of respective images A_(i), B_(i) and C_(i) are chosen from consecutive images of the original sequence as shown in FIG. 7. This method of constructing the sub-sequence is often called temporal sub-sampling if the image sequence is treated as a sequence along the time domain. While the example in FIG. 7 illustrates the case having 3 member sequences, it is understood that dividing the original sequence into 3 member sequences is not a limitation of the present invention. The original sequence may be divided into any integer number of member sequences for display concurrently on the screen. If the sequence is divided into M member sequences and the total number of images in the original sequence is not divisible by M, the last image of the original sequence may be repeated as needed to make it divisible by M. While the sub-sequences may be used as member sequences directly, further spatial processing by cyclically shifting the sub-sequences at different spatial distances may be used to generate the member sequences.

While FIG. 7 shows temporal sub-sapling as the method for constructing multiple sub-sequences, other methods can also be used. For example, the original sequence may be equally divided into 3 sections and the first section is assigned to the member sequence A, the second section to the member sequence B and the third section to the member sequence C as shown in FIG. 8. One advantage of this method is that, often at certain instances, the capsule camera may stay relatively stationary in some sections while the capsule camera may travel normally at other sections. Therefore, there will be some instances that images in some video windows show no motion or very little motion so that a diagnostician may focus his/her attention on images in other video windows. While the example in FIG. 8 illustrates the case that the original sequence is equally divided into 3 sections, the present invention can also be applied to cases that the original sequence is divided into other integer number of sections. As mentioned before, the collection of sub-sequences is called aggregated video. The aggregated video for the panoramic images may require more cognitive efforts for a diagnostician to identify anomaly. Therefore it may be desired to display the aggregated video at slower frame rate as mentioned before. On the other hand, if there is no movement or little movement for a period of time, it is desirable to display the aggregated video at faster frame rate during this period.

FIG. 9A shows a flowchart for a system embodying the present invention. At step 910, an image sequence from a capsule camera is received. The image sequence is used to generate multiple sub-sequences at step 912. The multiple sub-sequences are then used to form the aggregated video at step 914. The aggregated video is provided for display at step 916. FIG. 9B shows a flowchart for an alternative system embodying the present invention, where the images are captured by panoramic camera. Image pre-processing such as cropping, sub-sampling and enhancement is performed at step 920. The component images from all sides of the reflective element are stitched together to form a panoramic image at step 922. For some systems such as the system using a panoramic annular lens, the image captured is in a continuous field of view and there is no need for stitching. Therefore the step of stitching may be skipped for such systems. The collection of images forms an image sequence and the sequence is divided into M member sequences according a method in step 912. The member sequences are then composed into an aggregated video in step 914 and the aggregated video is displayed in step 916. The stitching for image with component images is optional. For member sequence method similar to that described in FIG. 7 the sub-sequencing could be done in real time without receiving all the images completely.

The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope. 

1. A method for displaying video of capsule images from a capsule camera system, the method comprising: accepting images captured with the capsule camera system; generating a plurality of video member sequences based on the images; composing an aggregated video comprising the plurality of video member sequences; and providing the aggregated video.
 2. The method of claim 1, wherein the video member sequences are generated by uniformly interleaving the images temporally.
 3. The method of claim 1, wherein the video member sequences are generated by dividing the images into temporally consecutive sections.
 4. The method of claim 1, wherein the aggregated video is provided according to a frame rate specified.
 5. A method for displaying video of panorama images, having a long edge and a short edge, from a capsule camera system, the method comprising: accepting panorama images captured with the capsule camera system; generating a plurality of video member sequences based on the panorama images; composing an aggregated video comprising the plurality of video member sequences; and providing the aggregated video.
 6. The method of claim 5, wherein the capsule camera system having multiple cameras arranged to capture a panorama view.
 7. The method of claim 5, wherein the capsule camera system having a single panorama-view camera.
 8. The method of claim 5, wherein the video member sequences are generated by uniformly interleaving the panorama images temporally.
 9. The method of claim 5, wherein the video member sequences are generated by dividing the panorama images into temporally consecutive sections.
 10. The method of claim 5, wherein the aggregated video is composed by arranging the plurality of the video member sequences along the short edge.
 11. The method of claim 5, wherein at least one of the video member sequences is a spatially shifted version of another one of the video member sequences.
 12. The method of claim 10, wherein the long edge is horizontally positioned.
 13. The method of claim 10, wherein the long edge is vertically positioned.
 14. The method of claim 5, wherein the video member sequences are generated by cyclically shifting the panorama images.
 15. The method of claim 5, wherein the video member sequences are generated by uniformly interleaving the panorama images temporally followed by cyclically shifting the interleaved images.
 16. The method of claim 5, wherein the aggregated video is provided according to a frame rate specified.
 17. A system for displaying video of images from a capsule camera system, the system comprising: an interface module coupled to accept capsule images captured with the capsule camera system; a first processing module coupled to the interface module for accessing the images and configured to generate a plurality of video member sequences based on the images; a second processing module coupled to the first processing module for receiving the plurality of video member sequence and configured to compose an aggregated video comprising the plurality of video member sequences; and an output interface module coupled to receive and to provide the aggregated video.
 18. The system of claim 17, wherein the video member sequences are generated by uniformly interleaving the images temporally.
 19. The system of claim 17, wherein the video member sequences are generated by dividing the images into temporally consecutive sections.
 20. The system of claim 17, wherein the aggregated video is provided according to a frame rate specified.
 21. A system for displaying video of panorama images, having a long edge and a short edge, from a capsule camera system, the system comprising: an interface module coupled to accept panorama images captured with the capsule camera system; a first processing module coupled to the interface module for accessing the panorama images and configured to generate a plurality of video member sequences based on the panorama images; a second processing module coupled to the first processing module for receiving the plurality of video member sequence and configured to compose an aggregated video comprising the plurality of video member sequences; and an output interface module coupled to receive and to provide the aggregated video.
 22. The system of claim 21, wherein the capsule camera system having multiple cameras arranged to capture a panorama view.
 23. The system of claim 21, wherein the capsule camera system having a single panorama-view camera.
 24. The system of claim 21, wherein the video member sequences are generated by uniformly interleaving the panorama images temporally.
 25. The system of claim 21, wherein the video member sequences are generated by dividing the panorama images into temporally consecutive sections.
 26. The system of claim 21, wherein the video member sequences are generated by cyclically shifting the panorama images.
 27. The system of claim 21, wherein the video member sequences are generated by uniformly interleaving the panorama images temporally followed by cyclically shifting the interleaved images.
 28. The system of claim 21, wherein the aggregated video is composed by arranging the plurality of the video member sequences along the short edge.
 29. The system of claim 28, wherein the long edge is horizontally positioned.
 30. The system of claim 28, wherein the long edge is vertically positioned.
 31. The system of claim 21, wherein the aggregated video is provided according to a frame rate specified.
 32. The method of claim 21, wherein the aggregated video is further processed by intensity transformation on a partial image basis. 